#LLM Agent

Research

DCA-Bench: A Benchmark for Dataset Curation Agents
Benhao Huang, 
Yingzhuo Yu, 
Jin Huang, 
Xingjian Zhang, 
Jiaqi W. Ma
May 1st 2025
KDD-2025 DB Track (Oral), ICML-2025 Data World
#LLM Agent
#Benchmark
#Data-centric AI

A benchmark exploring the performance of LLM Agents on detecting issues in datasets hosted on popular platforms.

paper
Github
🤗HuggingFace
slides
poster
Last Updated on Aug 10th 2025 Powered by greatest-gatsby-academic-template.